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ABSTRACT 

We use data from the DEBRIS survey, taken at 100 jJ.m with the PACS instrument on board 
the Herschel Space Observatory, to make a cosmic variance-independent measurement of 
the extragalactic number counts. These data consist of 323 small area mapping observations 
performed uniformly across the sky, and thus represent a sparse sampling of the astronomical 
sky with an effective coverage of ^2.5 deg 2 . 

We find our cosmic variance independent analysis to be consistent with previous counts 
measurements made using relatively small area surveys. Furthermore, we find no statistically 
significant cosmic variance on any scale within the errors of our data. Finally, we interpret 
these results to estimate the probability of galaxy source confusion in the study of debris 
discs. 
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1 INTRODUCTION 

One of the most fundamental measurements that can be made from 
extragalactic survey data is that of source number counts. In most 
cases a balance between survey area and depth must be struck, typ- 
ically resulting in a small number of relatively deep compact fields 
being observed alongside some wider shallow imaging. For exam- 
ple, the largest area surveyed using Hersche^ jPilbratt et alfeoiol) 
was the Astroph ysical Terahertz Large Area Survey (H-ATLAS; 
Ealesetal which covers 550 deg 2 but still constitutes only 

1.3 per cent of the entire sky. In addition, the H-ATLAS observa- 
tions are relatively shallo w, with deeper su rveys covering increas- 
ingly smaller areas (e.g. lElbaz et alj|201 if) . Consequently almost 
all source count measurements are potentially subject to cosmic 
variance: the statistical uncertainty inherent when inferring results 
based on data from a finite sub-region or sub-regions of the sky. 
Only by observing the entire sky can region-to-region fluctuations 
be averaged out entirely, giving a measurement truly free from cos- 
mic variance. 

In this letter we use multiple small-field observations 
(~28 arcmin 2 each) distributed randomly across the entire sky to 
obtain a cosmic variance independent assessment of the extragalac- 
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tic source number counts, and characterise the cosmic variance on a 
wide range of angular scales. This is achieved by treating each field 
as representative of the larger region which it inhabits. By calculat- 
ing the source counts for all of these fields at once, we can obtain 
a measurement which is representative of the entire sky, in which 
cosmic variance is dramatically reduced compared to contiguous 
area surveys of a similar size. By producing source counts from 
various sub-combinations of these fields, we can also estimate the 
cosmic variance - quantitatively - as a function of angular scale, 
thereby determining the scale at which a survey can be regarded as 
effectively free from cosmic variance. 

For this analysis we use the 100-/im data from the Disc Emis- 
sion via a Bias-free Reconnaissance in the Infrared/Submillimetre 
survey (DEBRIS; Matthews et al., in preparation), obtained with 
the Phot oconductor Array Ca mera and Spectrometer instrument 
(PACS - |PoglitschetalJl2010h on board Herschel. We do not in- 
clude an analysis of the 160-flm DEBRIS data, obtained at the 
same time as the 100-flm data, as their quality across the fully 
mapped area was insufficient to achieve a useful measurement of 
the source counts. 

This analysis is not intended to supersede t he results in this 
band from th e PACS Extragala ctic Probe (PEP - iLutz et alj|201lh 
presented bv lBerta et al .1 d2010h . who use sign ificantly deeper ob- 
servations, or the H-ATLAS results given bv lRigbvetal. l f20ll 
and Ibar et al. (in preparation), who use a significantly larger to- 
tal area. Instead this analysis focuses on the comparison of these 
results with those obtained from our cosmic variance indepen- 
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dent measurement of the source counts. Various clustering anal- 
yses have already been performed at Herschel wavelengths (e.g . 
ICoorav et alj20ld : lMaddox et al.l201(il ; lMagliocchetti et al.l201lb . 
however, these have all been restricted to relatively small scales, 
with the largest clustering analysis at 100 flm being at angular 
scales ^ 6 arcmin. Consequently we aim to provide a complemen- 
tary analysis, focusing on much larger angular scales. 

In §[2] we describe the data from which we obtain our source 
counts, along with our reduction method. §[3] contains a descrip- 
tion of our source extraction method, and the simulations we per- 
formed to quantify completeness. Our number counts and measure- 
ments of cosmic variance as a function of angular scale are given 
in §|4] These results are then interpreted to assess the impact of 
background source confusion on general debris disc surveys in §[5] 
Finally, our conclusions are given in §|6] 



2 THE DATA 

The DEBRIS survey is a Herschel Open Time Key Programme 
whose primary science goal is to discover and study debris discs 
around nearby stars (Matthews et al., in preparation). The target 
list is generally unbiased, with sources only being excluded if the 
cirrus confusion noise level at 100 flm was predicted to be greater 
than 1 .2 mJy for a point source at the time of survey design. The tar- 
get selection was ta ken from the Unbiased Nearby Star catalogue 
dPhillips et alj20ld) with sources being selected sequentially in or- 
der of distance from the Sun. With the exception of regions close 
to the Galactic plane where cirrus confusion is high, the observed 
fields are distributed uniformly on the sky, with no particular pref- 
erence in any one direction. The flux limited observing strategy 
employed by the DEBRIS team also ensures that each region is 
observed to the same depth, thereby providing a uniform data set. 

In this work we search for chance detections of extragalac- 
tic sources in the extended imaging around 323 debris disc tar- 
get fields. The observations were made using the PACS 'mini- 
scan map' observing mode, which provides maps with an area 
of approximately 3.5 arcmin x 8 arcmin, giving a total survey 
area of approximately 2.5 deg 2 . The telescope scanning rate was 
20 arcsec s -1 , and the full width half maximum of the resultant 
100- flm beam was 7 arcsec. This observing mode was designed to 
observe compact and point-like sources, with the majority of the 
integration time concentrated in the centre of the map. 

The maps were reduc ed using the Herschel Interactive 
Pipeline Environment (HIPE; I5"ttll20 1 Oh version 7.0. The standard 
processing steps were followed and maps were made using the 
'photProject' task. The data were high-pass filtered using a filter 
scale of 16 frames, equivalent to 66 arcsec, to suppress 1// noise. 
The final maps have a l-a noise level of ~2.0mJy beam -1 in the 
central region. 



3 SOURCE CATALOGUE AND SIMULATIONS 

Before finding and extracting sources we adjusted the maps to re- 
move any pixels whose integration time was below 25 per cent of 
the peak map coverage. This was done to remove the noisy edge 
regions which would confuse a source-extraction algorithm. A cir- 
cular mask with a radius of 45 arcsec was also applied at the map 
centre to blank out the original target star coordinates for each field. 

We identified so urces in the maps using t he source-extraction 
software, SExtractor jBertin & Arnoutslll996t) , and measured the 



flux density using aperture photometry. An aperture radius of 
20 arcsec was used for all sources. Custom aperture corrections 
were derived for this dataset with reference to the aperture correc- 
tions provided by the PACS instrument team. These custom correc- 
tions were applied to each measured source to obtain a final flux 
density measurement. 

The fragmented nature of this survey and the highly variable, 
irregularly shaped coverage within individual maps means it is dif- 
ficult to determine an accurate survey area and general noise level 
for this work. To overcome this issue we chose to adopt a refer- 
ence effective area with a regular shape for each field. The refer- 
ence area is required to be larger than the range of the measured 
data and in practice this was simply taken as the total area of the 
image, including all regions for which no data were obtained. A 
completeness correction could then be determined based on simu- 
lations performed within this reference area. As the reference area 
is required to be sufficiently large to enable all position angles of 
the rectangular map, a significant portion of the reference area will 
always contain no data, meaning that the maximum completeness 
possible is ~ 20 per cent. 

To obtain the completeness correction for these data we placed 
ten simulated point sources of a given flux density randomly into 
each map. We used observations of Arcturus (observation IDs 
1342188248 and 1342188249) - obtained in the same observing 
configuration as our data - as our point source model. This model 
was scaled in order to return the expected flux density following ap- 
plication of the appropriate aperture correction. No more than ten 
sources were input to a single map to avoid increasing the source 
density to a point wherein the simulated sources might blend and 
represent an increase in confusion noisfl). These simulated sources 
were extracted using the same method as for our original catalogue. 
The number of simulated sources detected was then obtained by 
differencing histograms constructed from the original and simu- 
lated source catalogues. The resultant histogram showed a Gaus- 
sian distribution of sources centred at the flux density of the input 
(simulated) source. A Gaussian fit was made to this histogram and 
the number of sources measured was compared to the number of 
simulated sources to obtain the completeness at this flux density. 
The fitted width of the Gaussian was also used as a measure of the 
uncertainty in flux-density measurement for all measured sources. 
This process was repeated for a range of flux densities to find the 
completeness correction across the flux densities measured in our 
original catalogue. The entire simulation process was repeated five 
times to reduce the uncertainty in the derived correction. 

The completeness correction is a statistical result and is only 
valid when working with the entire survey dataset and the adopted 
reference map areas. It is not applicable to specific individual re- 
gions of real data within any given map. The final DEBRIS galaxy 
source catalogue consists of 540 sources with a detection signifi- 
cance of ^ 3o~ and a typical l-a uncertainty of ~ 3.0 mJy. The typ- 
ical completeness for this catalogue is 14 per cent, which equates 
to ~65 per cent within the regions of the reference area (described 
above) in which data exist. The large range of integration times 
within the map means that completeness levels near 100 per cent 
could only be achieved for the extremely rare, bright sources. 
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Table 1. Measured extragalactic number counts, normalised to the Eu- 
clidean slope at 100 jlm 
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Figure 1. Galaxy source counts at 100 fim, normalised to the Euclidean 
slope. Source cou nts in various well-known survey fields from the PEP 
iBerta et alj|20ld) and H-ATLAS iRigbv et alj|201 ll) surveys are overlaid 
for comparison. 



4 NUMBER COUNTS AND COSMIC VARIANCE 
4.1 Number counts 

The measured galaxy number counts at 100 flm, normalised to the 
Euclidean slope, are given in Table Q] and plotted in Fi gure [T] The 
count s obtained at th e same wavelength from the PEP l lBerta et all 
l201fj) and H-ATLAS dRigbv etal.l201ll) surveys are also given, for 
comparison, with the three PEP counts being obtained from 2-deg 2 , 
450-arcmin 2 and 140-arcmin 2 fields fo r the Cosmic Evolution Sur- 
vey (COSMOS - IScoville et"ai]|2007l) . Lockman Hole and Great 
Obse rvatories Origins Deep Survey (GOODS-N - iGiavalisco et al.l 
|2004|) fields, respectively, and the H-ATLAS counts from the 14- 
deg 2 science demonstration field. Poisson noise provides the domi- 
nant uncertainty in the DEBRIS number counts, with the Eddington 
bias uncertainty ranging from ~90 to ~50 per cent of the corre- 
sponding Poisson noise level from low to high flux densities across 
the plotted range. The errors quoted are the quadrature sum of these 
two sources of error. 

Our number counts are in good agreement with those pub- 
lished by the PEP and H-ATLAS teams, indicating that the re- 
sults from these individual fields are genuinely representative of the 
larger-scale galaxy number counts within the given uncertainties of 
the relative samples. 
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Figure 2. Measured cell variance (top) and upper limits (bottom) as a func- 
tion of cell radius. Note that negative o~ indicates that the measured vari- 
ance is less than that expected from Poisson statistics. 



4.2 Cosmic variance 

To measure cosmic variance on various angular scales we began 
with the null hypothesis that there was zero cosmic variance on 
all scales (cr 2 = 0) . This hypothesi s was t hen tested using the for- 
malism set out by lEf stathiou et all d 1990h . The sky was split into 
circular cells of equal angular area, A, and galaxy number counts, 
N, were obtained for each cell. The cell-to-cell variations were then 
compared with wha t would be expected fr om Poissonian statistics 
using Equation 9 of lEfstathiou et al.l dl990h . 



Lj(Nj -AjZkNk/ZkAk) 2 - (1-1* 



K/(LkA k ) 2 



(LjNj/i:jAjy-\Z k A 2 k -2^ k Al/Z k A k 



(LkAi) 2 /(LkA, 



k n 
(i) 



This generalised equation allows for incomplete cell sampling 
and is therefo re appropriate for the DEBRIS sample (see 
lEfstathiou et al.l for details). The uncertai nty in this measuremen t 
(A(<T 2 )) was quantified using Equation 5 of Efstathiou et al jl 1990h . 
This method was also implemented bv lAustermann et al]d20ld vto 
assess the level of cosmic variance in four fields imaged with the 
AzTEC camera at 1 . 1 mm. 

Cells with an angular area equal to that of one DEBRIS map 
up to 2k steradians were tested. The measured variance from each 
cell size is shown in Figure [2] The measured variances have a 
median value of 1 .2 x 1 0~ 2 , in keeping with those reported by 
lAustermann et al ] d2010l) as well as the model of iMoster et al .1 
1 201 ll) . within the stated errors. However, in all cases the signal- 
to-noise ratio (rj 2 /A(rj 2 )) is less than 3. This indicates that these 
measurements do not represent a statistically significant detection 
of a non-zero cosmic variance on scales down to the size of one 
DEBRIS map. 

Although the non-uniformity and sparse sampling of the DE- 
BRIS data makes it difficult to calculate the angul ar correlation 
function for direct compari son with the results of ICoorav et al .1 
d20ld) . iMaddox et al.l d201oT) and iMagliocchetti et all d2011l) . it is 
still possible to compare their results with those from this analy- 
sis. All three prior publications show a decrease in clustering (i.e. 
variance) with increasing angular scale. The upper limits we obtain 
sho w a similar tren d, a lbeit w ith very low statist ical signifi cance. 

ICoorav et al] d201fj) , IMaddox et all d20ld) and 



4 B. Sibthorpe et al. 



iMagliocchetti et all d201lf) also find increased clustering with 
higher redshift galaxy populations. Therefore our non-detection of 
cosmic variance at 100 flm, which is typically dominated by low 
redshift (z ^ 1) galaxies, is in keeping with these previous results. 

The failure to detect cosmic variance in this work could be 
thought to be, in part, a result of the DEBRIS survey depth. Deeper 
observations would probe higher-redshift galaxy populations, too 
faint to detect in the current data. With deeper maps we would 
th erefore expect to obs erve the same level of clus tering at 100 flm 
as lCooravetall fcOld) and lMaddox et alj d2010h find at 350 and 
500 flm. However, these deeper data would also contain additional 
faint, low-redshift galaxies with a similarly uniform distribution 
to the population of nearby galaxies currently observed, thereby 
diluting the clustering signature of these additional high-redshift 
sources. In addition, if we assume that the clustering measured at 
350 flm is domi nated by galaxies c lose to the 3-(7 confusion limit 
and at z ~ 2.3 dCoorav et alj|201oh then the required l-O" survey 
limit to detect these sources at 100 flm would be ~0.2mJy, only 
slightly higher than the 100-flm confusion limit. Given these two 
factors, we conclude that our results are valid for all 100-flm sur- 
vey depths down to the limiting survey area of one DEBRIS map, 
given a comparable confusion noise limit. Therefore, a survey of 
~28 arcmin 2 would be expected to have O" 2 ^ 0.2. 



5 RELEVANCE TO DEBRIS DISCS 

Debris discs are most commonly identified by an infrared excess 
above the predicted photospheric level. It can be extremely dif- 
ficult to determine accurately whether a measured excess is due 
to a debris disc or merely a chance alignment with a background 
galaxy. Some features originally attributed to debris discs have sub- 
sequen t ly been found to be background objects, e.g. lGreaves et al.1 
d2005l) : lRegibo et al.l J2012h . Confusion is particularly problematic 
in statistical studies of debris discs. We expect cases of background 
contamination will be more likely when analysing many tens of 
discs, as is the case with the DEBRIS survey. It is therefore helpful 
to also interpret the results of this analysis in terms of its relevance 
to debris disc studies. 

In order to account for confusion, an accurate estimate of the 
probability of a background source being present within a given 
radius of a star is essential. This study has shown that data from 
DEBRIS observations, typical of almost all debris disc observa- 
tions made with Herschel, are consistent with data from surveys 
of typical extragalactic fields. We have also shown that, to within 
the limits of these data, there is no significant cosmic variance at 
100 flm. Consequently, it is reasonable to make use of the results 
from deep extragalactic surveys and apply their statistics directly 
to debris disc observations. For the purposes of estimating back- 
ground source confusion, we make the assumption that the invari- 
ance of the 100-fim counts can be extended to the other two PACS 

wavelength bands. 

Using the raw measured differential counts of iBerta et al] 

J201 1[), we estimated the number of sources that could potentially 
explain an excess at a given flux density. To ensure all potential 
sources were included, the number counts were weighted by a 
Gaussian probability density function with a l-O" width equal to 
the uncertainty of the original measurement. This provides a mea- 
surement that includes a contribution due to Eddington bias. We 
cal culated the numbe r of galaxies for all flux density bins given 
bv lBerta et all J20T l|) over a wide range of O" values, then fitted a 
two-dimensional polynomial to the output grid. lMagliocchetti et al.1 



Table 2. Probability, calculated using Equation[2] of one background source 
existing within a beam half-width half-maximum radius of the measured 
source loaction for a range of measured exccess flux densities. In all cases 
the excess emission is assumed to be detected at the 3-0" level. 
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0.12 
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J20T 1) do find clustering in these data on small scales, but the level 
of clustering is extremely low, and none of the measurements is 
statistically significant (< 3o" in all cases). As a result, we regard 
these data to be unclustered for the purposes described in this sec- 
tion. The number of galaxies per square degree, N, at a given flux 
density, Sx , and measurement uncertainty, O", can thus be estimated 
directly from the resultant polynomial fit. 



log 10 /V= £ £(2), v log 10 o- ! log 10 S{, 

U=0 



(2) 



where k(X)fj is the matrix of polynomial coefficients derived 
for the fit. These matrices for flux densities measured in mjy for 
each wavelength are, 



£(70 flm) 



fc(lOOflm) 



fc(160fim) 



3.34 
-2.45 x 10~ 4 
4.41 x 10~ 6 

3.84 
-0.0108 
7.94 x 10~ 5 

4.12 

-0.0208 
1.00 x 10~ 4 



-0.770 
-2.23 x 10~ 3 
-2.31 x 10~ 6 

-1.10 
0.0340 
-2.58 x 10~ 4 

-1.04 
0.0455 
-2.35 x 10~ 4 



-5 



-0.470 
5.23 x 10~ 3 
-1.89 x 10 

-0.181 
-0.0180 
1.44 x 10" 4 

-0.142 
-0.0210 
1.17 x 10~ 4 



Therefore, given an expected background source number density 
N(Si,(j), the Poisson probability of n sources existing within ra- 
dius r (in arcsec) is given by, 



P(n, < r,N) = exp 



Nnr 2 \ (A?7rr 2 /3600 2 )" 
3600 2 ) n\ ' 



(3) 



Using this formalism it is possible to calculate the probability of 
a background source being present in PACS observations of debris 
discs with an excess ranging from ~2-140mJy. Table [2] gives the 
probability of a source being present within a beam half-width half- 
maximum radius of a given source location, for a range of typical 
levels of measured excess flux density. In all cases the source is 
assumed to have been detected at the 3-0" level. These values have 
been calculated using Equation [2] and have been found to have a 
l-O" variation of 2.6 x 10~ 3 with respect to probabilities calculated 
directly from the original data. 

These results are suitable when estimating the confusion of an 
individual source with a clearly measured excess. However, when 
working with a survey consisting of many observations, looking at 
the probability of confusion at a specific flux density is less useful 
than asking the question what fraction of sources are likely to be 
confused by any number of background sources? 
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Figure 3. Probability of confusion by one or more background sources as 
a function of survey limiting flux density calculated via the Monte-Carlo 
method (symbols) and the naive method (dashed line). Polynomial fits made 
in log-log space to the Monte-Carlo points are plotted as solid lines, with the 
fitted equations given in the top right of each panel. In all cases the limiting 
flux density is assumed to be the 3-o~ limit for the given survey. Results are 
shown for confusion within various radii of a given source location. 



The cumulative extragalactic number counts can be used to 
estimate the probability of confusion, by one or more background 
sources, by integrating the counts from the survey limiting flux den- 
sity, S]j m , to the upper limit of the data. This number of galaxies, N c , 
can then be input to the cumulative Poisson probability equation, 

/ N c xr 2 \ " {N c 7lr 2 /3600 2 y 
e Trfff n\ ' (4) 



P c (n,<r,N c 



where P c is the probability of confusion with n or more background 
sources within a given radius r . 

Using the raw cumulative number counts, however, does not 
take account of Eddington bias. Due to the steep slope of the counts 
a larger number of sources will be boosted above S\[ m than will be 
boosted down, making this 'naive' measurement of ./V an under- 
estimate. To assess the significance of Eddi ngton bias in this case 
we used the cumulative number counts from lBerta et al.l d201ll) and 
performed a Monte-Carlo simulation. Simulations were performed 
for a range of limiting survey flux density and region radii, and the 
output is given in Figure[3] 

Results from the naive and Monte-Carlo methods are in good 
agreement for faint survey limits, but begin to diverge at limits 
greater than ~10mJy, with the naive method systematically un- 
derestimating the confusion. In these bright survey limit cases, the 
increased measurement uncertainty results in a higher contribution 
from Eddington bias and thus an underestimate in the naive esti- 
mate of N c . Note that the increase in the measurement uncertainty 
is a result of our assumption that the S\{ m is a 3-C7 limit. For higher 
survey thresholds, e.g. 5-C7, the deviation between the two methods 
would begin at higher flux densities. 



These results are applicable for surveys without a single, lim- 
iting flux density. In this case the probability of confusion for each 
observation, given its own limiting flux density, can be estimated 
and combined for all observations in the standard way to estimate 
the probability of source confusion. 



6 CONCLUSIONS 

We find our galaxy number counts - based on observations that 
sparsely sample the entire sky in a uniform manner - to be in 
agreement with previous studies whose results are derived from 
relatively small-area surveys. We also measure no statistically sig- 
nificant non-zero cosmic variance on scales of ~28 arcmin- (the 
scale of a DEBRIS map) up to 2n steradians, providing only upper 
limits. We conclude that traditional, relatively small-area surveys, 
such as those undertaken by the PEP and H-ATLAS teams, are in 
general sufficiently representative to measure and characterise the 
extragalactic number counts in a universal context. 
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